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noise  and  quantization  errors.  We  will  show  that  qualitative  depth  information  can  be 
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(or  computation)  of  camera  parameters.  The  only  constraint  is  that  the  epipolar  plane  of 
the  fixation  point  includes  the  A— axes  of  both  cameras.  We  derive  two  expressions  that 
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have  been  observed  in  psychophysical  experiments,  most  notably  the  “induced  size  effect” . 
Furthermore,  the  same  approach  can  be  applied  to  estimate  some  qualitative  behavior  of 
the  normal  to  the  surface  of  any  object  in  the  field  of  view.  More  specifically,  one  can 
follow  changes  in  the  curvature  of  a  contour  on  the  surface  of  an  object,  with  either  x-  or 
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Qualitative  Depth  and  Shape  from  Stereo 


Introduction 


Research  in  early  vision  regarding  stereo  seems  to  be  concerned  mainly  with  the  correspon¬ 
dence  problem,  namely,  finding  the  right  matching  between  points  on  the  left  and  right 
images.  Obtaining  exact  depth  values  from  a  stereo  pair  has  been  considered  a  simple  exer¬ 
cise,  whose  solution  is  well  known,  though  might  involve  some  tedious  but  straightforward 
computations.  Thus,  it  has  been  implicitly  assumed  that  the  final  goal  of  stereo  algorithms  is 
to  compute  an  exact  depth  map  using  disparity  values.  The  following  observations  suggest, 
however,  that  the  depth  computation  from  disparity  values  is  not  necessarily  straightforward 
or  even  feasible,  and  that  more  qualitative  depth  information  may  be  easier  to  obtain  and 
more  robust. 

First,  the  depth  computation  problem  reduces  to  a  simple  trigonometric  formula  when 
the  parameters  of  the  cameras,  or  the  eyes,  are  known.  When  they  are  not  known,  a  scheme 
to  compute  the  camera’s  parameters  from  a  number  of  conjugate  points  (that  is,  matched 
pairs  of  points  from  the  different  images)  has  been  devised,  involving  the  solution  of  a  set 
of  nonlinear  equations  (see  for  instance  Horn,  1986).  S;nce  the  problem  has  no  closed-form 
solution,  and  since  the  data  are  not  precise,  a  solution  is  found  using  iterative  methods  that 
minimize  the  sum  of  the  squares  of  the  errors.  In  practice,  however,  this  approach  is  very 
difficult  to  implement,  since  the  parameters  of  the  cameras  must  be  obtained  from  data  with 
error  on  the  order  of  magnitude  of  the  disparity  values,  which  are  the  raw  material  used  for 
depth  computation  (e.g.,  error  due  to  pixel  quantization).  In  other  words,  the  registration 
problem  (namely,  finding  parameters  for  the  camera’s  calibration)  is  much  more  difficult 
than  just  computing  depth  from  disparity  values.  Less  general  methods  to  perform  camera 
calibration  have  also  been  devised,  see  Prazdny  (1981)  and  Longuet-Higgins  (1981). 

The  other  observation  originates  from  biological  vision.  It  seems  that  human  vision  does 
not  necessarily  obtain  exact  depth  values  from  stereo  disparity  information  alone,  see,  e.g., 
Foley  (1977)  and  Foley  &  Richards  (1972).  Rather,  stereo  disparity  seems  to  be  used  mainly 
in  obtaining  qualitative  depth  information  about  objects  in  the  field  of  view.  Estimation  of 
the  magnitude  of  this  relative  depth  is  possibly  dependent  on  extraretinal  estimation  of  some 
physical  parameters  like  the  angle  of  convergence  of  the  eyes.  For  example,  whether  looking 
at  stereograms  with  crossed  or  uncrossed  eyes  affects  only  the  extraretinal  perception  of 
the  angle  of  convergence  of  the  eyes,  not  the  disparity  values.  It  also  results  in  a  different 
perception  of  the  depth  of  the  central  square  in  a  simple  random-dot  stereogram  (where  a 
central  square  in  one  image  has  a  constant  shift  with  respect  to  the  other). 

The  purpose  of  this  paper  is  to  exploit  the  geometry  of  the  situation,  where  a  scene  is 
viewed  from  two  different  angles,  to  obtain  insight  into  the  above  problem.  It  will  be  shown 
that  qualitative  relative  depth  information  (order)  of  various  kinds  can  be  obtained  from 
conjugate  points  alone  easily  and  reliably,  involving  almost  no  computations  and  indepen¬ 
dent  of  the  camera’s  parameters.  These  orders  will  demonstrate  some  anomalies  that  are 
observed  in  human  psychophysics  and  presently  lack  other  straightforward  explanations. 
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One  anomaly  is  the  induced  size  effect,  where  a  distortion  of  one  image  by  stretching  the 
Y -axis  (the  vertical  axis)  of  that  image  produces  a  tilt-impression  similar  to  that  produced  by 
stretching  the  X-axis  (the  horizontal  axis)  of  the  other  image  by  the  same  amount.  Whereas 
the  tilt  impression  caused  by  stretching  the  X-axis  has  a  simple  geometrical  explanation,  the 
reversed  tilt  impression  caused  by  stretching  the  Y -axis  has  no  geometrical  logic  behind  it, 
and  has  therefore  been  called  an  induced  effect.  Induced,  since  it.  is  as  if  the  unrealistically 
magnified  F-axis  induces  the  shrinking  of  the  X-  and  F-axis  as  a  compensation,  which  has 
the  similar  geometrical  interpretation  as  stretching  the  A' -axis  of  the  other  image.  This 
effect,  first  reported  by  Ogle  (1938),  has  stimulated  extensive  research,  see  Arditi  et  al. 
(1981),  Mayhew  (1982),  Longuct-Higgins  (1982),  Mayhew  Longuet-Higgins  (1982),  and 
Rogers  &  Koenderink  (1986). 

It  will  also  be  shown  that  some  qualitative  shape  information  can  be  obtained  from 
image  coordinates  only.  More  specifically,  one  can  follow  changes  in  the  curvature  of  a 
contour  on  the  surface  of  any  object  in  the  field  of  view,  with  either  x-  or  y-coordinate  fixed. 

The  exact  order  expressions  will  scale  in  proportion  to  the  angle  of  convergence  between 
the  two  cameras.  The  exact  relative  depth  can  be  computed  from  these  orders  using  few 
matched  points  and  some  approximated  numeric  scheme,  or  using  more  than  two  images. 
Alternatively,  it  can  be  estimated  from  some  external  estimation  of  the  physical  quantities 
involved,  namely  the  angle  of  convergence  and  the  angle  of  gaze,  in  agreement  with  the 
psychophysical  theory  suggested  in  Foley  (1977). 

Basic  Geometry 

Given  two  cameras,  assume  that  the  optical  axes  intersect  at  the  fixation  point.  Also, 
assume  that  the  epipolar  plane  of  the  fixation  point  (the  plane  through  the  optical  axes 
of  both  cameras  and  their  baseirae,  henceforth  “base  plane”)  includes  the  X-axes  of  both 
cameras  (which  are,  therefore,  epipolar  lines  by  definition).  Let  us  define  the  following 
coordinate  system  (see  figure  1):  let  the  fixation  point  be  the  origin,  the  base  plane  (which 
passes  through  this  point)  be  the  X  —  Z  plane,  and  the  line  perpendicular  to  this  plane 
through  the  origin  be  the  F-axis.  On  the  X  —  Z  plane,  the  optical  axes  of  both  cameras 
intersect  in  the  origin  and  create  an  angle  2/i  between  them.  Let  the  Z-axis  be  the  angle- 
bisector  of  2/r,  and  the  X-axis  perpendicular  to  the  Z-axis  in  the  X  —  Z  plane.  This  system 
is  very  similar  to  the  cyclopean  coordinate  system  commonly  used  in  the  literature,  with 
the  exceptions  that  the  angle-bisector  is  replaced  by  the  median  to  the  interocular  line  and 
the  origin  is  translated  to  the  mid  point  of  the  intcrocular  line.  A  similar  system  can  be 
defined  for  the  case  of  motion,  if  the  fixation  point  is  kept  constant.  That  is,  the  observer 
follows  the  same  point  with  his  eyes.  This  is  more  typical  of  human  vision  than  machine 
vision. 

For  a  given  point  P  =  (r,y,r),  let  o  denote  the  angle  of  tilt  -  the  polar  angle  of  its 
projection  on  the  X  —  Z  plane  (cv  =  arctan(|-)).  Let  /?  denote  the  angle  of  slant  -  the 
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Figure  1.  The  base  plane  (X  —  Z )  viewed  from  above,  with  two  cameras. 


polar  angle  of  its  projection  on  the  Y-Z  plane  (/?  =  arctan(|)).  Thus  P  can  be  also  written 
as  P  =  ( tana’  tan"/?1  where  z  is  its  depth  relative  to  the  fixation  point  in  the  above 
coordinate  system.  Let  (xj,t/()  and  (xr,yr)  be  the  Cartesian  coordinates  of  the  projection 
of  P  on  the  left  and  right  images  respectively.  Using  polar  coordinates,  the  two  projections 
can  be  written  as  (rj,i?j)  and  (rr,$r)  respectively.  Let  A  =  .  Then  the  following  can 

be  shown  to  hold  (see  appendix): 

tana  =  - — — -  ’  (1) 

tan  fj,  A  +  1 

cot  dr  -  cot  i9,  . 

tanp  =  - — -  (■*) 

2sm/i 

Thus,  the  two  angles  a  and  /?  depend  only  on  the  angle  of  convergence  and  the  polar 


tan/?  = 


[  *4%  ,'4^  *.(  *.k  1,1  v.l  1 
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angles  of  the  conjugate  points.  It  can  be  shown  that  the  polar  angles  are  preserved  under 
projection,  through  any  point  on  the  optical  axis,  onto  either  a  spherical  body  (like  the  eye) 
or  a  planar  one  (a  camera).  There  is  no  dependence  on  other  parameters  of  the  cameras 
(which  could  be  different  ones),  their  relative  positions,  or  the  angle  of  gaze.  Equation  (1) 
will  be  used  in  the  next  section  to  obtain  an  order  on  all  matched  image  points  in  each 
visual  hemifield  according  to  their  tilt.  This  order  is  independent  of  the  camera  parameters 
and  demonstrates  psychophysical  anomalies  like  the  induced  effect  and  others.  Equation  (2) 
will  be  used  to  obtain  an  expression  for  the  relative  depth  2.  However,  this  expression  will 
depend  on  the  camera’s  parameters  like  focal  length  and  interocular  distance,  and  the  angle 
of  gaze.  A  parameter-independent  relative  depth  order  can  be  obtained  from  this  expression. 
It  will  hold  for  small  angles  of  convergence  2/x,  as  will  be  discussed  in  a  succeeding  section. 

Tilt-related  Order 

From  (1)  it  immediately  follows  that  a  is  a  monotonic  increasing  function  of  A  for  a  fixed 
configuration  of  the  cameras.  Thus,  the  ratio 

A^  coti?‘  cott?£ 

A j  cot  t9j  Cot  1 9j 

gives  qualitative  relative  distance  information  on  any  two  points  i  and  j  in  each  visual 
hemifield  in  the  following  sense:  if  the  ratio  is  greater  than  1,  meaning  a-,  >  <a-;,  then  a 
separating  plane  between  points  i  and  j  through  the  fixation  point  and  perpendicular  to 
the  base  plane  will  leave  point  j  and  the  viewer  on  one  of  its  sides,  and  point  i  on  the  other 
side  “further  away”  from  the  viewer.  In  other  words,  A  defines  an  order  on  all  the  matched 
points  in  a  given  hemifield.  This  order  corresponds  to  Euclidean  distance  if  points  i  and  j 
are  approximately  on  the  same  line  of  sight  from  the  viewer,  namely  about  the  same  image 
x-coordinate. 

Note  that  if  A  is  constant  on  all  points  segmented  as  belonging  to  the  central  object, 
then  it  is  a  planar  object  with  some  tilt  towards  or  away  from  the  viewer,  according  to  the 
sign  of  A  —  1.  Moreover,  since 


coti?r  > 


cott 9 1 


i  1  a  =  0°, 


it  follows  that  —  1 9r  also  gives  a  qualitative  estimation  to  the  tilt  of  a  point  P  relative 
to  the  fixation  point.  If  the  fixation  point  is  at  the  same  distance  from  both  cameras,  this 
estimate  would  indicate  whether  P,  relative  to  the  fixation  point,  is  tilted  awayr  from  the 
camera’s  baseline  (a  >  0°),  “parallel”  to  the  baseline  (a  ~  0°),  or  towards  the  camera’s 
baseline  (a  <  0°). 

The  order  expression  A  has  been  defined  as  a  function  of  the  polar  angle  d  only  in  both 
eyes.  This  is  especially  convenient  since  the  polar  angle  is  preserved  under  projection  onto 
either  a  spherical  body  (the  eye)  or  a  planar  body  (a  camera).  However,  it  might  prove 


|g 

US: 

u 

m 

If 

'  "  V  .  -  • 


vx-x^x*:  ov-x-' 


I 


Qualitative  Depth  and  Shape  from  Stereo 


Weinshall 


useful  to  examine  A  as  a  function  of  the  Cartesian  coordinates  (x/,y/)  and  (xr,yr)  in  both 
images,  assuming  planar  projection.  In  this  case: 


In  A  =  In 


cot  i9r 


xr/xi 


COt  'O  l  Vr/yi 

(4) 

=  (lnxr  -  lnx/)  -  (lnyr  ~  lny/) 

=  A(lnx)  —  A(ln  y). 

In  other  words,  if  any  matching  algorithm  is  applied  to  the  output  images  of  the  transfor¬ 
mation  T  :  (x,y)  — ►  (In x, lny)  performed  on  the  original  images,  and  the  disparity  vector 
(Ax,  Ay)  is  then  computed  in  the  usual  way,  then  the  difference  Ax  —  Ay  =  In  A  is  an  order 
of  the  same  type  as  A,  with  no  need  for  any  additional  computation.  The  transformation 
T  is  indeed  singular  near  the  vertical  and  horizontal  meridians.  However,  there  is  evidence 
from  neurophysiology  for  the  existence  of  a  related  transformation  between  the  retina  and 
VI,  the  complex  log,  which  is  singular  at  the  origin  (see  Schwartz,  1984). 

One  prediction  of  the  order  A  is  the  “induced  effect”,  the  psychophysical  effect  where  a 
distortion  of  one  image  by  stretching  the  Y -axis  (the  vertical  axis)  of  that  image  produces  a 
tilt  impression  similar  to  that  produced  by  stretching  the  A" -axis  (the  horizontal  axis)  of  the 
other  image  by  the  same  amount  (see  introduction).  Evidently,  the  decision  rule  specified 
above  exhibits  the  same  qualitative  behavior  since  it  involves  only  terms  of  the  form 
Hence  multiplying  the  F-axis  by  some  number  has  the  same  effect  as  multiplying  the  X- 
axis  by  its  inverse.  Thus  an  induced  effect  is  a  natural  side-effect  of  using  the  relation  , 
which  does  not  depend  on  any  assumptions  and  approximations,  or  the  complete  recovery 
of  all  depth-related  parameters  of  the  scene.  On  the  other  hand,  some  other  computational 
explanations  to  this  effect,  e.g.  Mayhew  &  Longuet-Higgins  (1982)  and  Mayhew  (1982), 
obtain  the  induced  effect  as  a  by-product  of  a  specific  approximation  scheme  and  a  tedious 
numerical  computation;  it  does  not  result  from  an  exact  solution  of  the  disparity  equations. 
Another  computational  explanation  (Arditi  et  al.,  1981)  suggests  that  a  distortion  occurs 
in  the  matching  stage,  assuming  matching  is  done  along  horizontal  lines  only. 

Motion  also  shows  an  illusion  similar  to  the  induced  effect  (see  Rogers  Sz  Koenderink, 
1986).  In  this  case,  observers  reported  that  a  fronto-parallel  surface  appeared  to  be  tilting 
in  depth  with  the  right-hand  side  apparently  closer  than  the  left  when  the  monocular  image 
was  progressively  magnified  with  head  movement  to  the  right  and  vice  versa.  A  can  account 
for  this  phenomenon.  Moreover,  in  this  case  there  is  an  additional  effect  -  a  perceived 
forward/backward  motion.  This  could  possibly  be  accounted  for  by  the  angle  of  gaze  v.  As 
will  be  shown  later, 


tan  v  = 


tan// 


1  _  ur(r=0) 

1  I  Mr=0)  ' 

'  !/i(z=0) 


Thus,  a  distortion  of  the  y-axis  in  one  image  will  distort  ;/  (a  distortion  of  the  x-axis  will 
not  affect  v,  however).  The  angle  of  gaze  v  can  be  used  to  obtain  the  direction  of  motion 
in  the  following  way  (see  figure  2):  v  is  0  (the  true  angle  of  gaze)  for  motion  from  point 


.  *4r. 


Al.*! »*I*V  AWi 
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1  to  point  2.  Positive  i/,  the  computed  angle  of  gaze,  implies  motion  from  1  to  2',  that  is, 
backward  movement  of  the  head  in  addition  to  its  left  to  right  movement.  Since  the  head 
does  not  move,  the  object  is  perceived  as  moving  backwards. 


Figur°  o  Fa|Sfc  positive  angle  of  gaze  induces  perception  of  backward  motion  (point  2'), 
whereas  the  true  value  0  shows  no  such  motion  (point  2),  see  text. 


The  quantitative  aspect  of  this  order,  namely,  obtaining  numerical  values  for  a  and  p  from 
A,  will  be  discussed  in  a  succeeding  section.  However,  since  computing  A  involves  computing 
ratios  of  the  image’s  a: -coordinates  and  y-coordinates,  one  should  expect  problems  near  the 
meridians.  The  horizontal  meridian  is  of  special  interest  since  the  matching  algorithm  should 
give  relatively  good  results  on  this  meridian.  In  this  respect,  it  is  interesting  to  note  that 
human  observers  also  demonstrate  deteriorating  performance  near  the  meridians,  especially 
near  the  horizontal  meridian  ( Arditi,  1982).  This  deterioration  is  demonstrated  by  a  smaller 
probability  for  a  correct  detection  of  the  tilt  of  an  oblique  line  when  either  the  x-  or  y-axis 
is  magnified,  and  when  the  angle  between  the  oblique  line  and  the  horizontal  meridian  is 
around  either  0°  or  90°.  Compensation  for  errors  near  the  horizontal  meridian  is  further 
discussed  in  the  section  on  error  analysis. 
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Depth-related  Order 


From  equation  (2)  one  can  obtain  an  explicit  expression  for  the  depth  z  of  a  point  relative 
to  the  fixation  point  (the  origin).  First,  note  that  (2)  implies 

z  =  (coti9r  —  cott?/)  •  — - — .  (5) 

2  sin  fi 

Thus,  xy  —  (coti?r  —  cot i9;)  gives  exact  depth  order  on  all  the  points  in  space  with  some 
constant  height  y  over  the  base  plane.  It  follows  that  this  order  is  most  useful  to  compare 
points  that  differ  mainly  in  their  x-coordinate  with  y  approximately  the  same.  The  previous 
order  A,  on  the  other  hand,  was  most  useful  to  compare  points  that  differ  mainly  in  their 
y-coordinate  with  x  approximately  the  same.  This  fact  stands  in  agreement  with  much 
psychophysical  evidence  on  the  anisotropy  between  the  vertical  and  horizontal  dimensions. 


Next,  let  us  derive  an  expression  for  z  that  depends  only  on  scene  and  camera  param¬ 
eters.  In  the  appendix  it  is  shown  that,  for  v  the  angle  of  gaze,  I  the  interocular  distance, 
and  h  the  focal  length  of  the  cameras, 


or 


y  =  { 
y={L 


I  COs(fi  —  v) 
sin2/x 

cos  (/i  +  v) 
sin2y 


—  x  sin  y,  +  z  cos  fi  >  •  — , 

h 


+  x  sin  fi  +  z  cos  fi 


\  vi_ 

r h  ■ 


(6) 


(6') 


Substituting  (6)  in  (5)  for  a  point  in  the  right  hemifield,  ((6')  will  be  used  otherwise),  gives: 

I  cos( fi  —  v)j  sin  2 fi 


z  = 


{  i2h  +  tanMxr  +  J-x/)]-cos/u} 


Thus,  for  an  angle  of  convergence  2 fi  small  enough  so  that  2h  |  tan  fi(xr  -f  ^x;)|,  we 
obtain  a  relative  depth  order  x  on  all  the  points  in  the  visual  field,  where 


Vr 

X  =  xr - XI- 

yi 


As  will  be  shown  in  the  section  on  error  analysis,  =  1  4-  0(fi).  Likewise,  since  the  field 
of  view  is  mechanically  bounded  by  some  2£  <  180°,  it  follows  that  x  <  /?  tan£.  Thus,  a 
sufficient  condition  for  the  appropi lateness  of  to  a  first  order  in  fi,  is  1  ^  tan  fi  ■  tan  if.  If 
2£  <  90°,  which  is  a  reasonable  upper  bound,  then  it  is  sufficient  if  1  tan  fi,  or  2 fi  <C  90°. 
To  illustrate,  the  distance  to  the  point  of  fixation  should  be  much  greater  than  3  cm  for  an 
average  person  looking  straight  ahead,  possibly  30  cm  or  more. 


We  have  obtained,  then,  a  relative  depth  order  that  is  the  traditional  x-disparity  cor¬ 
rected  for  non  zero  vergence  (angle  of  gaze  v  not  0)  and  some  field  location  (x-coordinate) 
distortion.  However,  for  a  fixed  convergence  angle  2ft,  this  order  has  some  distortion  relative 
to  the  physical  relative  depth,  which  increases  with  the  horizontal  distance  from  the  point 
of  fixation  (the  x-coordinate). 


1 
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Qualitative  Shape  from  Stereo 

The  triple  (a,  ft,  z)  as  a  point  representation,  and  equations  (1)  and  (2),  turn  out  to  he  useful 
for  surface  normal  analysis.  For  any  two  points  P\  and  P2,  where  P,  =  z  \  (  --1  -  - ,  ,  1  ) 

and  P2  =  tlifiJT’  *)’  let  N  =  Pi  x  P2.  N  is  perpendicular  to  (Pj  -  P2).  (It  is 

actually  proportional  to  the  normal  to  the  plane  passing  through  P\ ,  P2,  and  the  fixation 
point.)  After  some  calculations,  it  can  he  shown  that 

-t  cot  (3\  —  cot  /?2  COta2  —  COtO] 


N  —  z( 


cot  a  j  cot  —  cot  e*2  cot  Pi  '  cot  cv  j  cot  ~  cot  a2  cot 


=  z(-}—f(0l4lr,0lK)  ,  -^—gWK,0l02r)  ,  1), 

tan  p  sm  y 


where 


f(V}X,#l#2r)  = 


(cot  —  cot  i!)J )  —  (cot  i ?2  —  cot »9*) 


r’  r  (cot  t92  —  cot  i9|.)  -f  (cot  i92  —  cot  ) 

n(,P  ,9*  i92  ,92t  -  COt^COt^1  -COt792COtd;. 

l'  r1  1  '  r  (cot#2  —  cot i9J.)  +  (cot#2  —  cot#} ) 

Thus,  as  long  as  /(#} ,  t9 J. ,  #2,  if2)  and  y(#} ,  #*,  #2,  #2)  remain  constant,  which  can  be  deter¬ 
mined  from  image  coordinates  only,  the  points  are  coplanar  (among  themselves  and  with 
the  fixation  point),  or  the  object  at  the  center  of  gaze  is  planar.  Note  that  A  is  obtained 
from  /  when  cot#2  =  cot  #2  =  0  (g  —  0  then). 

Moreover,  for  any  object  it  is  possible  to  obtain  qualitative  information  about  its  surface 
along  any  contour,  with  either  r  or  y  fixed.  Take  a  contour  on  the  surface  with  some  fixed 
y-coordinate,  and  let  P]  and  P2  be  two  points  on  it.  Since  the  y-coordinate  of  P,  —  P2  is  0, 
the  projection  of  N  on  the  X  —  Z  plane,  n  —  z{  /(#} ,  #}.,  #2,  #2 )  ,  1),  is  perpendicular  to 

the  projection  of  P(  —  P2.  Thus,  for  fixed  y.  the  one  dimensional  boundary  contour  is  convex 
when  /( #J,#}.,#2,#2)  increases  with  increasing  x,  concave  when  /(#} ,  #}. ,  #2,  #2)  decreases, 
and  linear  when  /(#} ,  #).,  #2,  #2)  remains  constant.  Note  that  \y  can  be  obtained  from  / 
since  the  sign  of  /  determines  relative  depth  between  twa>  points  with  fixed  y-coordinate. 
The  same  qualitative  description  can  be  obtained  for  any  boundary  contour  with  fixed  r  from 
following  y(#},#}.,#2,#2)  with  increasing  y,  x  fixed.  This  qualitative  description  depends 
on  image  coordinates  only,  more  specifically  on  polar  angles  of  the  conjugate  points.  Thus  it 
predicts  an  “induced  effect”  for  planes  that  do  not  include  the  point  of  fixation.  Obtaining 
this  description  is  not  trivial,  though,  since  such  a  contour  in  the  world  coordinate  system 
will  be  usually  mapped  to  an  oblique  line  in  the  image  plane  due  to  convergence. 

In  the  general  case,  the  normal  to  a  plane  passing  through  three  points  in  space  P] ,  P2. 
and  P3  depends  on  the  image  coordinates  and  the  angle  of  convergence  y  in  a  more  complex 
way,  so  that  y  should  be  known  to  compute  the  (exact)  3-D  normal.  (The  focal  length  h 
should  be Renown  as  well.)  However,  if  Px  —  P2  =  (0,y,c)  and  P2  -  P3  =  (x,0,z)  or  vice 
versa,  the  normal  can  be  computed  from  the  above  argument  (up  to  a  scaling  factor  of  the 
x-  and  y-coordinates,  depending  on  y). 
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Alternatively,  one  can  estimate  the  normal  to  the  plane  passing  between  three  points 
P\ ,  P2,  and  P3  to  a  first  order  in  p  and  the  x-disparity 


~  g-*/ 


2 h  sin  p 

In  this  case,  after  substituting  K^h  •  (xr  —  g x/)  as  an  approximation  for  z,  where  K,ivi,  is 
some  constant  that  depends  on  p,  v,  and  h,  one  gets  an  expression  for  the  general  normal 
Ng- 

Ng  =(A  -  P2)  x  (P2  -  P3) 

x},  yi,  y} ,  x2,  zf ,  y2,  y2,  x®,  x?,  yj?,  y,3), 

U»vkG{x\,x\  ,y\,y]  txItx'I  1), 

where  and  are  some  constants  which  depend  on  p,  n  and  h.  F()  and 

G{ )  are  some  functions  of  images  coordinates  only.  Once  again,  one  can  verify  planarity  of 
surfaces  of  objects  in  the  field  of  view  when  F  and  G  remain  constant.. 


Numerical  Computation 

Let  us  compute  the  exact  tilt  and  depth  value  to  a  first  order  in  the  convergence  angle  2 p . 
following  Mayhew  &:  Longuet-Higgins’  (1982)  method  to  compute  tilt  and  slant  of  a  plane 
through  the  fixation  point.  The  following  scheme,  however,  will  be  simpler  and  involve  less 
and  more  rigorous  assumptions  (we  shall  only  assume  small  2p  as  implied  above).  Since,  to 
a  first  order  in  p,  tan2p  ~  !-ca^ ,  where  R  is  the  distance  between  the  fixation  point  and 
the  midpoint  of  the  interocular  line  (the  nose),  our  computations  will  be  to  a  first  order  in 

(*)• 

Let  (x,y)  and  (x',y')  denote  the  image  coordinates  of  a  certain  point  in  space  on 
the  two  cameras,  respectively.  Let  a  and  [i  denote  the  parameters  of  a  plane  that  passes 
through  a  given  point  in  space  and  the  fixation  point  in  the  above  coordinate  system,  so 
that  Z  =  aX  +  (3Y .  Thus  a  is  tan(a)  in  the  previous  notations  if  /3  —  0  and  (3  is  ta n(/3)  if 
a  =  0. 

Then,  to  a  first  order  in  p,  we  have  (Longuet-Higgins  &  Prazdny,  1980) 

Ax  =  x'  —  x  =[(d  cos(n)  +  sin(n))x  +  3  cos(i/)y 

+  (cos(n)  —  asin(i/))x2  —  0  sin(i')xy)  ■  I/R , 

(7) 

Ay  —  y'  —  y  =[sm(u)y  -f  (cos(n)  —  a sm(i>))xy 
—  /3sin(n)y2]  ■  I/R. 

(The  coordinate  system  used  to  obtain  (7)  is  the  cyclopean  coordinate  system.  This,  how¬ 
ever,  does  not  change  the  results  when  changing  to  our  coordinate  system  since  the  angle- 
bisector  and  the  median  are  the  same  line  to  a  first  order  in  p  and  the  translation  of  the 
origin  has  been  taken  into  account  in  the  definition  of  the  target  plane.) 
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For  a  given  point  in  space,  one  can,  in  the  more  interesting  cases,  take  the  plane  passing 
through  it  and  the  fixation  point  which  is  perpendicular  to  the  base  plane,  for  which  0  =  0. 
This  plane  would  be  determined  only  by  a  (the  plane  perpendicular  to  the  base  plane,  which 
is  usually  unique).  Thus,  we  have 


A  if 

- =  fsin(i/)  +  (cos(i')  —  dsin(i/))x]  ■  I/R 

y  k 


(8) 


=  [tan(^)  +  (1  —  drtan(?'))x]  •  tan(2/i). 

Let  (  xi  ,  t/i  ,  Axi  ,  Ayi  )  be  the  coordinates  of  a  point  on  the  vertical  meridian,  so  that 

Xi 


0.  Then  we  have 


— —  =  tan(n)  •  tan(2/i). 

y  i 


(Recall  that  — j-f-  =  tan (v)  ■  tan(/u)  always.) 


i+ 


Let  (  X2  ,  t/2  >  Ax2  ,  At/2  )  be  the  coordinates  of  a  point  with  osO.  Such  a  point,  if  it 

i  i 

exists,  can  be  easily  identified  since  it  satisfies  ^  ~  t--  Then  we  have: 

x2  •  tan(2/i)  =^1~  tan(iz)  •  tan(2/i)  = 

V2  y2  y\  y2  y\ 


In  other  words, 


tan(2/i)  =  — - 
x2 


r  v±  _  yj. 
y2  yi  ' 


Now,  for  any  point  (x,y)  in  the  image  we  have,  using  (7)  with  0  =  0: 

i 


x_  _  y_ 

x  y 

This  leads  to  the  final  equations: 


A  T  A7/ 

- =  dr  ■  Icos(t/)/R  =  a  ■  tan(2/i). 

x  y 


1 


tan(2f/)  =  — -[^i-  ^-j. 

*2  y  2  y  1 


(9) 


and: 


a  = 


5-  _  JL 
*  y 


tan(i/)  = 


Ayi 

yi 


R  = 


tan(2/i)  ’  tan(2/i)  ’  yW^)  +  Asi2 ' 

The  ratio  near  the  vertical  meridian  is  relatively  reliable  and  easy  to  obtain.  However, 
a  point  with  d  0  does  not  necessarily  exist,  in  which  case  we  can: 

1.  Follow  Mayhew  &  Longuet-Higgins  (1982)  and  neglect  the  term  atan(t'),  but  not  the 
term  tan(i'), 


tan(2^)  =  -  •  [V-  -  Hi], 

*  y  y  1 

If  we  neglect  tan(^),  for  consistency,  we  obtain: 

/0  .  1  A xj 

tan(2/x)  =  —  •  - . 

x  y 


(9') 


(9  ) 


v» 


A' 


ft 

Co 


1 

!b 

SC; 


<V" 
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2.  Solve  the  initial  scheme  without  such  a  point.  Given  a  vertical-meridian  point,  there 
remains  a  fairly  simple  equation  to  solve.  This  would  be: 

.1 


Vi  x  y  yi 

tan(2/x)  •  a  =  — - —  , 

x  y 


(10) 


which  reduces,  after  substituting  tan(2/x)  from  the  second  equation  in  the  first  equation, 
to  a  second  degree  polynomial  in  d. 

A  different  numerical  approach  would  be  to  use,  for  example,  three  images  taken  while 
moving  on  the  base  plane.  Denote  by  (xo,yo)i  (xi,yi)  and  2 t  J/2 )  the  coordinates  of  the 
conjugate  projections  of  some  point  P  on  the  three  images.  Denote  by  ai,  Mi  and  uL 
the  angle  of  tilt,  half  the  angle  of  convergence,  and  the  angle  of  gaze,  respectively,  in  the 
coordinate  system  defined  as  above  by  the  first  two  images.  Denote  by  a2 ,  M2  and  v2  the 
same  angles  in  the  coordinate  system  defined  by  the  last  two  images  (see  figure  3).  For 
motion  on  a  straight  line  we  have: 


tanoj  •  tan/xi  = 


A(xo,yo,xi,yi)  —  1 

A(xq,  yo,  xiyyi)  d- 1 


tan  V\  •  tan  /xj  = 


1  _  yi(*=°) 

yo(x=Q ) 
•j  I  yt(*=Q) 
^  yo(*=o) 


tan  o2  •  tan  M2 


tan  i>2  •  tan  /x2 


A(a?i,yi,  J2,ya)  - 1 
A(xi,yi,x2,y2)  +  1 


1  _  ya(i=o) 
yi(x=o) 

1  ,  »2(l=0) 

1+ 


—  <*2  =  Ml  +  M2 
v2  —  V\  =  Ml  +  M2  > 

where  yy! is  the  F-axis  ratio  on  the  vertical  meridian  (x  =  0)  between  conjugate  points 
in  images  i  and  j. 

Thus  we  have  six  nonlinear  equations  with  six  unknowns.  For  small  p,’s  we  have 
approximately  a  linear  problem,  where  the  solution  is  a  null  vector  of  the  approximating 
matrix.  We  will  obtain  a  similar  set  of  equations  if  we  take  two  points  in  the  three  images 
and  ignore  the  equations  involving  the  angle  of  gaze  v.  In  this  case  the  motion  in  the  base 
plane  does  not  have  to  be  in  a  straight  line. 


Error  Analysis 

First,  from  the  definition  of  A  and  x  if  follows  that  the  base  plane  itself  is  singular  in 
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Figure  3.  Three  images  from  motion  on  the  base  plane. 

the  sense  that  these  orders  are  not  defined  for  points  on  it.  The  same  problem  exists  in 
the  analysis  of  normals  to  surfaces  of  objects.  One  can,  however,  estimate  the  orders  and 
normals  by  substituting  of  a  matched  point  far  from  the  base  plane.  More  specifically, 
for  P  =  (x,y,  z)  we  have 

yr  di  z  2  sin  p  tan  v  x  2  sin  p  x  z 


1  +  tan  p  tan  v  dr  1  +  tan  p  tan  v  dr  1  +  tan  p  tan  v  dr  dr 
Thus,  if  point  P 3  is  used  to  approximate  point  P',  the  error  will  be: 

—  '~r~  ^an  v  +  ~~r~~  ■ 

\yi  /  \yt  /  14-  tan p  tan  v  dr  dr 

The  error  is  0  if  the  approximating  point  lies  exactly  “above”  P*  (differs  only  in  the 


'I 
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y-coordinate). 

Moreover,  one  can  use  as  an  estimation  ^  (the  first  two  terms),  so  that  some 

(possibly  extraretinal)  estimation  of  /x  (half  the  angle  of  convergence)  and  v  (the  angle  of 
gaze)  will  suffice  to  give  a  rough  estimation  to  when  no  other  source  of  information  is 
available.  Note  that  one  can  not  take  ^  ta  1  when  computing  xr  —  *^-x/,  as  a  first  order 
approximation  in  /x,  since  xr  —  x\  is  of  the  order  of  magnitude  of  y  also. 

Second,  let  us  consider  the  violation  of  the  basic  assumption,  namely,  that  the  Af-axes 
themselves  of  both  cameras  are  epipolar  lines.  This  introduces  an  error  Sr  and  St  in  the 
polar  angle  of  a  given  point’s  projections  on  the  right  and  left  images,  respectively.  Thus, 
the  true  orders  should  be: 

cot(i?r  +  6r ) 


A  = 


cot(i9t  +  St) 
cot  i9r  cot  i9, 


+ 


S,~ 


tan  $ i 


cot  i?/  '  cos2  1 9 1  ‘  sin2 

Xs  =  cot(dr  +  Sr)  -  cot(dj  +  St) 

1 


ST  +  o(6), 


=  (cot  t?r  —  cot  t h)  — 


■  Sr  + 


St  +  o(6), 


sin2  dr  r  sin2 

where  Sr,Si  <  S.  The  main  conclusion  from  this  is  that  the  effect  of  axes  misalignment 
is  greater  near  the  horizontal  and  vertical  meridians,  and  possibly  negligible  further  away. 
Also,  this  error  affects  less  the  expressions  for  qualitative  shape  (the  normal  to  iso-x  or  iso-y 
surface  contours),  since  they  involve  differences  where  this  error  is  somewhat  cancelled  out 
for  the  two  points. 


Discussion:  Comparison  to  some  empirical  data 

The  orders  A  and  x  as  defined  above,  and  the  way  the  scaling  coefficients  depend  on  camera 

parameters,  seem  to  be  consistent  with  the  following  psychophysical  results: 

1.  The  advantage  of  relative  depth  perception  in  human  vision,  which  is  more  reliable  than 
absolute  depth  perception.  That  is,  it  is  easier  to  distinguish  between  different  objects 
if  they  differ  a  little  in  depth  than  to  give  a  good  estimation  to  the  absolute  depth  of  a 
given  object  in  space  with  no  additional  information  of  perspective. 

2.  The  induced  effect,  as  discussed  above,  which  can  be  shown  to  be  a  natural  side  effect 
of  using  the  tilt-related  order  A  to  estimate  the  tilt  of  a  plane  at  the  center  of  gaze.  No 
assumptions  on  the  way  the  visual  system  finds  and  interprets  corresponding  points  is 
needed.  Moreover,  this  is  a  “local”  explanation  of  the  induced  effect  in  the  sense  that 
it  allows  for  opposite  induced  effects  in  neighboring  spatial  regions,  in  agreement  with 
psychophysical  evidence  (see  Rogers  &  Koenderink,  1986).  Likewise,  this  explanation 
does  not  imply  a  perceived  asymmetric  convergence  of  the  eyes,  again  in  agreement 
with  empirical  data.  It  is  interesting  to  note  that  A  might  be  the  discrete  equivalent  to 
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the  term  of  the  optical  flow  field  used  by  Rogers  &  Koenderink  (1986)  to  explain  the 
induced  effect  with  motion  parallax.  Quantitatively,  A  is  more  susceptible  to  errors  near 
the  meridians,  in  agreement  with  psychophysical  experiments  (Arditi,  1982)  that  show 
deterioration  in  human  performance  of  tilt  estimation  near  the  meridians.  Finally,  such 
an  effect  for  a  plane  not  passing  through  the  fixation  point  is  predicted  by  our  qualitative 
shape  analysis. 

3.  The  difficulty  in  comparing  right-hemifield  and  left-hemifield  points  for  their  depth,  as 
demonstrated  by  the  following  experiment  (Rogers  &  Graham,  1983):  given  two  planes, 
one  tilted  away  from  the  viewer  from  the  center  to  the  right  and  one  tilted  towards  the 
viewer  from  the  center  to  the  left,  whose  end  points  on  the  left  and  right  sides  respectively 
are  equidistant,  the  observer  will  (wrongly)  perceive  the  right  ride  of  the  first  plane  as 
closer  in  depth  than  the  left  side  of  the  second  plane.  If,  however,  the  whole  configuration 
is  rotated  by  90°  so  that  one  plane  is  tilted  upwards  from  the  center  and  the  other  is 
tilted  downwards  from  the  center,  with  far  from  the  center  end  sides  equidistant,  then 
the  observer  will  actually  (rightly)  perceive  those  two  ends  as  equidistant.  This  result  is 
consistent  with  the  above  order  A,  which  orders  points  on  the  right  and  left  hemifields 
separately,  on  either  side  of  the  horizontal  meridian,  but  not  if  they  are  on  different  sides 
of  the  vertical  meridian,  x  is  also  defined  differently  in  the  two  visual  hemifields,  and  it 
has  some  distortion  as  a  function  of  the  horizontal  distance  between  the  two  points. 

4.  Empirical  evidence  for  the  dependence  of  relative  depth  perception  on  extraretinal  per¬ 
ception  of  the  angle  of  convergence  of  the  eyes,  which  is  predicted  by  using  the  orders  A 
and  x  to  evaluate  depth  with  no  additional  computation. 

5.  The  anisotropy  between  the  horizontal  and  vertical  dimensions  in  relative  depth  per¬ 
ception,  as  demonstrated,  e.g.,  by  the  advantage  of  using  flanking  lines  as  test  objects 
versus  vertically  displaced  lines  (Westheimer,  1979).  Moreover,  both  vertically  displaced 
or  horizontally  displaced  stimuli  are  preferred  over  diagonally  displaced  stimuli  in  the 
sense  of  lower  disparity  threshold  for  discrimination.  This  is  consistent  with  two  distinct 
qualitative  orders  that  prefer  horizontal  or  vertical  displacement. 

6.  Psychophysical  results  suggesting  either  deterioration  in  depth  distinction  when  points 
are  coplanar  with  the  point  of  fixation  or  depth  perception  relative  to  a  plane  through  the 
fixation  point,  which  is  context  dependent  and  not  necessarily  the  front o-parallel  plane 
(see  Mitchison  &  Westheimer,  1984).  Note  that  A  is  constant  for  points  coplanar  with 
the  fixation  point. 

7.  The  use  of  A  for  threshold  estimation  in  acuity  experiments  predicts  some  results  like 
an  increase  of  threshold  with  standing  disparity  (Westheimer,  1979)  or  the  cosine  rule 
(Ebenholz  &  Walchli,  1965). 

Moreover,  A  and  xy  depend  only  on  the  polar  angles  of  the  conjugate  points  in  both 
images,  a  quantity  that  is  preserved  under  projection  to  a  spherical  body  (an  eye)  or  a 
planar  body  (a  camera).  It  is  interesting  to  note,  in  this  respect,  that  the  first  visual 
transformation  from  the  retina  to  FI  in  primates  seems  to  be  in  good  agreement  with 
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^  the  complex-log  mapping  (Schwartz,  1984),  namely:  (x,y)  — *  (logr,t?).  This  mapping 
jf*  explicitly  computes  the  polar  angle  t?  of  a  point. 


Summary 

The  goal  of  this  work  had  been  to  obtain  qualitative  information  from  a  stereo  pair,  with  as 
few  computations  as  possible  and  minimal  dependence  on  the  camera  and  scene  parameters. 
We  have  shown  that  points  in  a  stereo  pair,  once  matched  to  each  other,  can  be  ordered 
according  to  two  distinct  order  expressions:  a  tilt-related  order  A,  which  is  roughly  a  relative 
depth  order  when  ordering  points  with  only  vertical  displacement,  and  a  depth- related 
order  x  which  is  best  to  order  points  with  only  horizontal  displacement.  These  orders 
are  completely  determined  by  image  coordinates  of  conjugate  points,  and  no  camera  or 
scene  parameters  are  needed  (which  need  not  be  similar  for  both  cameras).  A  and  some 
variation  of  \  (xy)  depend  only  on  the  polar  angles  of  the  conjugate  points  in  both  images, 
a  quantity  which  is  preserved  under  projection  to  a  spherical  body  (an  eye)  or  a  planar  body 
(a  camera).  Moreover,  given  the  polar  angles  of  the  images  coordinates,  some  qualitative 
shape  information  can  be  obtained:  one  can  follow  changes  in  the  curvature  of  a  contour 
on  the  surface  of  any  object  in  the  field  of  view,  with  either  x-  or  y-coordinate  fixed.  We 
demonstrated,  by  further  analyzing  the  exact  equations,  that  obtaining  the  quantitative 
information  is  much  harder  and  less  reliable  than  obtaining  the  qualitative  one,  e.g.,  orders 
like  A  and  x-  It  usually  involves  some  assumptions  on  the  scene  or  extra- retinal  information, 
plus  a  lot  of  computations.  These  computations  tend  to  be  less  robust  and  sensitive  to 
noise  and  errors.  Finally,  we  discussed  some  psychophysical  (and  neurophysiological)  data 
which  seem  to  support  the  use  of  such  orders  in  humans.  Most  notably,  the  use  of  A  for 
tilt  estimation  predicts  “the  induced  size  effect”,  an  unusual  behavior  of  the  human  visual 
system  which  lacks  other  straightforward  explanation. 


Appendix 

Consider  the  base  plane,  which  includes  the  X-axes  of  both  cameras  and  both  their  optical- 
axes.  This  is  illustrated  in  figure  4,  where  0  is  the  focal  node  of  one  camera,  A  is  the 
fixation  point,  A'  is  the  projection  of  A  on  the  image  plane  or  the  origin  of  the  camera 
coordinate  system,  B  is  the  projection  of  a  given  point  in  space  ( C )  on  the  base  plane,  B' 
is  the  projection  of  B  on  the  camera  X-axis,  and  C'  is  the  projection  of  the  point  C  on 
the  image  plane.  In  the  base  plane,  we  add  the  point  D  which  is  the  projection  of  B  on 
the  optical  axis  AA! .  Let  <p  denote  the  angle  LB  AO.  Let  (x,y)  denote  the  coordinates  of 
the  projection  of  point  C  on  the  image,  so  that  x  =  A! B'  and  y  =  B'C' .  Let  d  denote  the 
distance  of  the  fixation  point  to  the  eye,  so  that  d  =  AO,  and  let  h  denote  the  focal  length 
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of  the  camera,  so  that  h  =  A'O.  Using  similar  triangles,  one  can  verify  the  following: 

BC  BO  __  DO  _  d  —  AB  cos  tp 

~  =  WU  =  TU  ~  h  ' 


C 

A 

d_AE5!^4b 


Figure  4.  The  base  plane  viewed  from  above,  with  one  camera. 
In  other  words, 

V  = 


hBU 


d  —  AB  cos  <p 

Using  the  same  arguments,  we  obtain 

AB  simp  DO  d  —  AB  cos  tp 

x  A'O  h 

In  other  words, 


x  = 


Thus 


hAB  sin<f 
d  —  AB  cos  tp 

AB  . 
sin 


(11) 


y  BO 

Note  that  our  assumption  that  the  base  plane  intersects  both  cameras’  X-axes  implies  that 
the  same  geomelry  holds  for  both  cameras  in  the  sense  that  the  segments  BC  and  AB  are 
identical  in  both  cases.  {A  and  C  are  the  same  points,  and  B  is  identical  since  C  is  projected 
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onto  the  same  plane.)  Let  us  add  indices  for  the  variables  of  the  left  and  right  cameras,  l 
and  r  respectively.  Then 

xT  AB 

Vr=W'Sm(Pr 


Finally 


xj  AB 
—  =  ==  •  sinu>;. 
Vi  BC  r 


Ir  jXf  _  sm<^r 

yr  /  yi  sin  tpt ' 


Figure  5.  Angles  of  tilt  on  the  base  plane  as  viewed  from  above. 

Figure  5  illustrates  the  geometry  on  the  base  plane  with  both  cameras,  and  the  coordinate 
system  used  in  the  text.  Recall  that  2//  denotes  the  angle  of  convergence  of  the  cameras  in 
the  base  plane,  and  that  the  Z-axis  is  defined  as  the  angle-bisector  of  this  angle.  Also,  a 
was  defined  as  arctan(  *■),  so  that  a  =  —  90°.  These  definitions  of  n  and  a  imply  the 

following: 

<pr  =  a  4-  90°  —  n 
<pt  =  a  - f-  90°  + 


.  W 


'-WN'iiV 
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Thus 


sin<^r  cos  a  cos  p  -f  sin  a  sin  /x  1  -f-  tan  or  tan  ^ 


sin<^;  cos  a  cos  /x  —  sin  a  sin  ft  1  —  tan  a  tan  fi 
We  have  defined  A  =  lt ^  =  /§f"  ^rom  (13)  and  (14)  we  obtain 


(14) 


A  = 


1  +  tan  a  tan  fi 


tana  tan/x  = 


A-  1 


A  +  1  ’ 


1  —  tan  a  tan  p, 
which  is  equivalent  to  equation  (1). 

Now,  if  C  =  ( x,y,z )  in  the  world  coordinate  system  we  have  defined  above,  then 


cot  dr  —  cot  t =  — - -  =  =(sin  ifir  —  sin  <x>i) 

Vr  yi  BC 


AB. 


BC 


2  sin  a  sin  /x 


=  2  sin  fi-  . 

y 


Since  by  definition  tan/?  =  we  immediately  obtain  equation  (2). 

Let  us  develop  the  expression  for  the  image  coordinate  y  above,  considering  the  right 
image  with  no  loss  of  generality.  We  then  have  from  (11), 

hBC 


yr  = 


dr  —  AB  cos  <p  r 
hy 


dr  —  x  sin  n  +  z  cos  /x 


From  figure  1,  in  which  the  angle  of  gaze  v  is  defined,  it  follows  that  dr 


/  cos( n~v) 


I  cos(/x  -  v)  .  yr 

y  =  ( - : - x  sm  /.i  +  z  cos  /x)  •  — . 

sin  2/x  h 


so  that 

(6) 


Applying  the  same  argument  to  the  y  coordinate  of  the  left  image,  we  obtain 


I  cos(p-fix)  .  yi 

y  ~  ( - r— v - -  +  xsin/x  +  ^  cos /x)  •  — . 

sm  2/x  h 


(6') 
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